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Abstract 



We present an optimized and physically motivated method for separating 
top quark signal events from background events at the Tevatron. For the 
top quark signal ti —* e//i + 4 jets, we show how to reject all but 25% of 
the background in a data sample while retaining 80% of the signal, without 
introducing bias into the subsequent mass measurement. The technique used 
is the Binary Decision Tree. Combining this highly efficient procedure for 
signal identification with a novel algorithm for top quark reconstruction, we 
propose a powerful new way to measure the top quark mass. 
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The CDF and D0 collaborations recently announced the much-awaited discovery of the 
top quark Both collaborations will next endeavor to study its production and decay 

properties further, and to improve the measurement of its mass. An important aspect of 
the analysis is the need to reject a good fraction of the numerous background events, while 
keeping most of the signal. 

In this Letter, we employ an artificial-intelligence algorithm, the Binary Decision Tree 
||, to discover optimized and physically motivated cuts that discriminate signal from back- 
ground with an efficiency well beyond what is possible using conventional methods [||]. By 
exploiting differences between the signal and background without relying on explicit recon- 
struction of the top quark signal, these cuts moreover introduce no bias into measurement 
of the mass. After presenting the optimized cuts, we propose a new top quark mass recon- 
struction algorithm in which a peak in a selected 3-jet mass distribution reveals t — > jjj and 
provides a direct measurement of m t along with a model-independent measurement of the 
background. With the anticipated integrated luminosity of the current experimental run at 
the Tevatron, there will be enough events not only to see the mass peak clearly, but also 
to observe the subsequent hadronic decay W — > jj, furnishing a new, direct calibration of 
hadronic calorimetry and the jet-finding algorithm. 

In the Standard Model, the top quark decays electroweakly via t — > W + b. The W boson 
in turn decays hadronically to two jets (W + — > jj) approximately 2/3 of the time, and 
semi-leptonically (W + — > e + z/ e , p^v^ t + u t ) in the remaining 1/3. At the Tevatron, top 
quarks are mainly produced in pairs pp — > ti+X. Due to severe QCD backgrounds, reliable 
detection of a top quark pair requires at least one of the two resulting W bosons to decay 
semi-leptonically into e or /i. We will focus on the "single leptonic" signature £+4 jets where 
I = e or fi. These events occur with six times the rate for double-lepton events, and have 
the added virtue of containing only one neutrino, which facilitates the mass measurement. 

The main background to this mode is from the direct production of pp — > W + 4 jets, 
occurring at about two times the signal rate in the Standard Model. To suppress this 
background, one can exploit the fact that two of the 4 jets in the signal are due to 6-quarks 
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which can be tagged with some probability, while 6-jets are rare in the background. Because 
we seek high signal acceptance, we will eschew a 6-tagging requirement, but point out below 
how it can be used, when available, to complement our analysis. 

In the absence of 6-tagging, the weapon of choice for reducing the background is to 
impose cuts in appropriate observables. Consider for example m^-, whose distribution is 
shown in Fig. 1. (m^- is the lowest of the 6 invariant masses formed from pairs of the 4 
jets.) The signal peaks near 75 GeV, while the background (dotted curve) is concentrated 
at low rn^j. Requiring each event to have a minimum observed m^j can thus increase the 
signal/background ratio S/B, without appreciable loss of signal. 

Our first improvement over previous analyses comes from introducing new variables, 
including m^, and showing how the physics of the background and signal makes these 
variables powerful tools for signal enhancement. The major thrust of our work, however, 
is toward obtaining cuts in a set of observables simultaneously. Before describing how the 
Binary Decision Tree determines these highly efficient cuts, we review the conventional route 
to signal vs. background discrimination. 

Based on comparisons of signal and background distributions like Fig. 1, a list of candi- 
date observables is selected. A simple cut specified by x^ > x ijiain and/or Xi < x iimax in each 
variable Xi is arrived at by trial-and-error adjustment, compromising between background 
rejection and signal acceptance. Each cut is relaxed or tightened in turn to roughly opti- 
mize S/ B at the desired level of signal acceptance. The virtue of this procedure is that the 
physical nature of each cut is understandable. For example, the simple cut m^- > 50 GeV 
enhances signal because the background's jets tend to arise from bremsstrahlung, where the 
collinear and soft singularities of QCD give rise to low pair masses. If there are two or more 
variables, however, simple cuts are usually far from optimal. Consider the case of just two 
observables. One could examine the two-dimensional scatter-plot of the signal and back- 
ground to select an Syi^-enhancmg cut. Simple cuts would partition the scatter-plot along 
lines running parallel to the coordinate axes, with events in one or three of the resulting 
quadrants to be accepted and all others rejected. Let us further assume the signal and back- 
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ground distributions are Gaussian. In this case, an optimal cut is generally along an ellipse 
or hyperbola which is the contour of constant S/B, and it cannot be written as one or even 
several simple cuts. Even in the special case where the optimal cut lies along a straight line 
(which happens in the Gaussian case when the signal and background are identical except 
for their centroids), that line is generally not a simple cut, because it need not be parallel to 
a coordinate axis. Furthermore, as Fig. 1 shows, the variables used here are obviously not 
Gaussian ||, so the form of the optimal cut is not apparent. It is unlikely, however, that 
the optimal cut is close to any set of simple cuts. Thus, finding the proper cuts by hand is 
difficult for two variables, and seemingly impossible for more than two variables. 

The neural network approach |6|,[|] offers an alternative for signal/background classifi- 
cation that avoids the restrictive form of simple cuts. It has the unfortunate drawback, 
however, of yielding a "black-box" solution whose cuts are not easy to interpret in physi- 
cal terms. In addition, the "training" of the network to arrive at the final cuts can make 
heavy demands on computer time. Some other algorithms that have been considered, in- 
cluding if -matrix and Probability Density Estimation ||, also efficiently separate signal 
from background, but fail to match the transparency of simple cuts. 

In this paper we advocate instead the Binary Decision Tree ||, which, compared to the 
conventional method, yields much higher signal efficiency. The decision tree has been shown 
to perform at the same level as the neural network in an earlier simple study of the top quark 
signal || , but with the crucial distinction that it yields explicit physically interpretable cuts 
and makes more modest demands on computer horsepower. The basic decision tree was 
described in Refs. |3],|8| . We outline the algorithm in the form implemented in the program 
hastac ||, which has been tailored for use in high-energy physics signal identification. 

Let the set of variables x — (x\, . . . , x n ) define the feature space of events, with each xt 
an observable such as m|,-. A generalized cut in {x\, . . . ,x n ) is the requirement that each 
event satisfy the inequality a ■ (x — x°) > 0, where a = (a\, . . . , a n ) is a vector normalized 
to Y^i Q% — 1- The geometrical interpretation of this expression is clear: the feature space is 
cut in two by a hyperplane passing through the point x°, with the hyperplane orientation 
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specified by its normal a. Simple single-variable cuts are just hyperplanes restricted to 
normals along one coordinate axis of the feature space. The power of the decision tree 
derives from its ability to optimally determine a and x$ for one or more generalized cuts. In 
this paper, we will restrict ourselves to two or three generalized cuts, which are sufficient to 
strongly suppress the background. 

The optimized hyperplane cuts are found by the decision tree as follows ||. Approx- 
imating the step function as 9(X) ~ 0(A) = (1 + e -A / T ) -1 , where T is a relatively small 
number, the number of signal events S\ falling on the "accepted" side of the hyperplane is 
approximated by 

S A (a,x°)=J2®(a-(x(a)-x )) , (1) 

a 

with a sum over all signal events a. B A (a,x°) is defined analogously for the background. 
With S\ and Ba thus transformed into differentiate functions of a and x°, we employ 



conjugate gradient optimization [T(J to maximize 

Q N (a,x°) = S A (d,x°)/[B A (a,x°)] N . (2) 

The parameter N can be chosen to assign primary importance to S/B enhancement (N = 
1 =>- Q = S/B) or to high signal acceptance (N — > =>- Q — > S). The value N = 0.5 =>- 
Q = S/ \f~B makes the optimized function Q equal to the approximate statistical significance 
S/ Ob of the signal, assuming S and B to be Poisson distributed. After optimization, each 
cut is specified by {a, x }, or more concisely by a form a ■ x < c, where c is a number. 
Qualitative interpretation of each cut is through the relative signs of eij, which indicate 
positive or negative correlation in each variable with the likelihood of an event being signal. 

Next, we describe the physical features of the signal and background on which our efficient 
cuts are based. The primary background to the top quark signature £ + 4 jets is the set of 
processes leading to direct production of W + 4 jets. After minimal acceptance cuts given 
below, about 40% of the background is due to qq — ► Wgggg processes. The other major 
sources of background are qg Wgggq, qq — > Wggqq and qg — > Wgqqq (q = quark or anti- 
quark), with contributions ranging from 15% to 30%. The background is thus characterized 



by processes with multiple gluon jets in the final states. The structure of the matrix elements 
dictates that much of the cross-section will lie in regions in phase space close to collinear 
and/or infrared divergences. Near-collinear radiation of jets with respect to the incoming p, p 
leads to jets with low transverse momentum px (due to quark and gluon bremmstrahlung) 
and/or high pseudorapidity rj = — log tan 9/2. Collinear and infrared divergences influence 
gluon bremmstrahlung and splitting, leading to production of q + g or g + g with small 
relative angle and low dijet mass rrijj. The trijet masses rrijjj similarly tend to be low. 

In strong contrast, the large mass of the top quark pair implies that it is produced with 
low velocity (50% of the time with v/c < 0.32 ) at the Tevatron energy y/s = 1.8 TeV. ( The 
bulk of top quark pair production is through qq — * ti, with the next largest contribution 
gg — > ti representing only about 10%.) The velocities of t and i are also small. The two- 
body top quark decay t — > bW + is roughly isotropic in the top rest frame, giving the b 
jet a characteristic maximum transverse momentum scale ~ m t /2. In fact, we find, for a 
top quark of mass 175 GeV, that the 6-jet p T distribution peaks at 52 GeV with average 
71 GeV. The jets from hadronic W decay share the W momentum, so their average px is 
somewhat smaller but still peaks at 32 GeV with average 56 GeV. One expects large trijet 
masses {rrijjj ~ m t ), and also large dijet masses: rrijj ~ m w , or rrijj ~ m t /^/3 in view of 
the kinematic relation m\ 2Z = m\ 2 + + ^23- 

Before making a detailed comparison of signal and background, we list the minimal 
acceptance cuts we impose to simulate detector acceptance, and describe our calculation of 
the signal and background. The acceptance cuts are 

p T (j) > 17.0 GeV, p T (£) > 20.0 GeV, i> T > 25.0 GeV, 

|77(i)|<2.0, |^)l < 2.0, R(j,f)> 0.7, R(j,e)>0A. (3) 

Here R(j,j') = \J (Ai]) 2 + (A0) 2 where A<p and At] are the differences in azimuthal angle 
and pseudorapidity between jets j, f. To simulate detector resolution, the rj and (f) of each 
parton was smeared from its true value by Gaussian random amounts with standard devia- 
tion 0.05 in each. The missing transverse momentum p'x, which is taken as a measurement 
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of Pt, was calculated by smearing each parton energy by a Gaussian random amount with 
a(E)/E = C ' I ' \JE~t where C = 0.6 for jets and 0.15 for £, before calculating the transverse 
momentum imbalance. To simulate the effects of hadronization, we further smeared the jet 
energies so that a(E)/E = 1.0/y/Er- 

We employed helicity amplitude techniques to compute top quark production, keeping all 
top quark and W boson decay correlations. To calculate the background, we used the Monte- 
Carlo package vecbos [0. We used the CTEQ2 set 5 parton distributions Hl2"f , which are 
leading-order fits and hence appropriate for our leading-order calculation. Similarly, we used 
a leading-order form for a s , with Aqcd given by the parton distributions. Factorization and 
renormalization scales were chosen as /jlr = p,p = m t for the signal and (1r = p,p = 
for the background. The background rate in particular has theoretical uncertainties, so its 
direct measurement described below is most welcome. We will discuss the specific case of 
m t = 175 GeV in considerable detail, but also include results for m t = 190 GeV in the 
table for a comparison. These results show that the efficacy of both our cuts and our top 
mass-reconstruction depend very weakly on the true value of m t . 

Assuming the projected integrated luminosity / Cdt = 100pb _1 for "Run I" at the Teva- 
tron, we expect a total of 49 top quark signal events (for m t = 175 GeV), and 116 background 
events, to pass the minimal acceptance cuts @. Thus we begin with S/B ~ 0.42 before our 
discrimination cuts. 

Some variables that we have tried as input to the decision tree program hastac are 
ordered versions of the observables discussed above. The jet transverse momenta are Pr(j) > 
... > Pt{j)- The jet pseudorapidities are l^ 1 ^)] > ... > |^ 4 (j)|. The dijet masses are 
m], > . . . > m% and the triiet masses are mj,,- > . . . > m% 4 . 

33 33 J 333 333 

Even before application of the decision tree, several of these variables point out significant 
differences between signal and background. In the signal, one pair of jets comes from the 
decay of a W, so the minimum dijet mass m^,- is less than mw except for smearing effects 
and the W width. The pair masses otherwise tend to be large, so as shown in Fig. 1, the 
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signal climbs steadily with to a peak near mw, after which it drops sharply. In contrast, 
the background falls quickly from its largest value at m^- pa 2p™ m (j) = 34 GeV. A simple 
cut m®j > 40 GeV passes (S,B) = (43,50.2) events. A tighter cut could even raise S/B 
above one: > 56 GeV passes (S, B) = (30.8, 18.2) events. The power of this variable 
reflects the qualitative differences between signal and background described above. 

Similarly, the distribution in lowest trijet mass m|- ■ for the signal rises to a peak near 
m t and then falls sharply because the signal cannot have a minimum trijet mass above m t 
(modulo jet resolution and width of the top quark). Meanwhile the background distribution 
falls steadily with rn A ---. The simple cut m|- - > 120 GeV would accept (S,B) = (42.0,44.0) 
events. However, unlike the distribution, a tighter cut would not yield any further 
significant enhancement in S/B. It is interesting to note that m|- - by itself could serve as a 
crude but effective method to directly detect a top quark mass peak above a smoothly falling 
background, without recourse to any fitting procedure or assumptions about the value of 
rrif 

As expected from the infrared-enhancement in the QCD background, the minimum jet 
transverse momentum p T (j) also distinguishes well between signal and background. The 
cut p T (j) > 25 GeV keeps (S, B) = (35.2, 36.2) events. An extremely tight cut of p T (j) > 
35 GeV will raise S/B to more than 2, at the cost of signal acceptance, with (S,B) = 
(17.2,8.2) events. We note in passing that D0 employed a related variable, the scalar sum 
of the jet transverse momenta H T = J2i \PtU)\- A high signal acceptance cut in H T is as 
good as PtU) — taking Ht > 210 GeV, the events passing this cut are (S, B) = (37.4, 39.4) 
- but no amount of tightening the cut on H T will obtain S/B significantly over 1. 

The three observables just discussed, m^-, m^j, and p T (j), are the most powerful dis- 
criminators we have found, as judged by their solo performances. We used them as input 
to the hastac optimization, in concert with four additional variables whose individual dis- 
tributions do not so clearly separate signal from background, but which prove useful in 
correlation with the first three. We should note that several other observables (e.g., the 
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other rrijj, m 1 ^^ and p l T (j)) are similarly helpful, so that our choice of variables was dictated 
largely by taste and the ease in interpreting the final cuts. 

The first two of these four additional variables are the two largest jet pseudorapidities 
|t7 4 (j)| and |r/ 3 (j)|, which complement p T (j) in recognizing jet radiation that is collinear with 
the incoming beams. A third is the W transverse momentum Pt(W), which peaks toward 
~ m t /2 in the signal and tends to be smaller in the background. The fourth added observable 
is the maximum trijet mass m^j, which can serve to close a "high- mass" loophole for the 
background: though most dijet masses in the background tend to be low, some very high 
masses can be generated when two gluons, for example, are radiated off opposite incoming 
beams. Because of the acceptance cut on PtU), there is usually another jet with sufficient 
PtU) t° combine with the pair to produce a large trijet mass in addition to the large dijet 
mass. 

To follow the conventional route at this point, one would make cuts in several of these 
variables simultaneously, and by trial- and-error adjust the cuts for the best discrimination. 
That route would not only be laborious; it would also totally miss any useful correlations 
between the variables, because it permits only "rectangular" cuts. We therefore presented 
the variables to hastac for automatic generation of efficient generalized cuts. We detail our 
generalized cuts in Table 1. Because two of the cuts involve variables of different dimension, 
we scale all momenta and masses by my/ for convenience. We compare the background with 
a signal for m t = 175 GeV in the following, but note that very similar results for m t = 190 
GeV are indicated in Table 1. 

The first generalized cut (a) drastically shrinks the background by simultaneously requir- 
ing high PtU), m %ji m % an d Pt(W), precisely as anticipated in the discussion above. The 
advantage of generalized cuts shows up in the extra 25% decrease of background relative to 
cuts in any one variable for the same signal efficiency. The second cut (b) attempts to close 
the high-dijet mass loophole, while at the same time requiring more centrally located jets. 
The cuts (a-b) pass 79% of the signal, but only 17% of the background, giving S/B almost 
as high as the tight cut in p T {j) described above, but with twice the signal acceptance. This 
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set of high-acceptance cuts (a-b) will serve as the starting point for our reconstruction of 
the top quark mass. But first we comment on the more stringent third and fourth cuts. 

These two cuts function similarly to cut (b), but have much lower signal acceptances. 
They are intended only for the sake of illustrating how an even higher S/ B can be obtained 
without explicit top quark reconstruction (though the latter clearly may also be used to 
increase S/B). Indeed, in the more extreme case (cuts (a-b) and (d)), the signal/background 
ratio is almost 4, which is unattainable through any of the variables taken individually. It is 
also interesting to note (bearing in mind that we have not included full hadronization and 
detector effects) that the more moderate set of cuts (a-c) is comparable in both S/B and 
signal acceptance to that achieved by CDF through 6-tagging. From the above interpretation 
of these cuts, it is likely that they are fairly complementary to 6-tagging. Assuming a single 
6-jet tagging efficiency of 40% for the signal, the other 60% of the events passed by cuts 
(a-c) would represent sizeable signal acceptance otherwise rejected by 6-tagging. DO, on the 
other hand, could use these cuts alone to match the previous background rejection of CDF, 
despite their lack of a silicon vertex detector. Finally, we remark that although we have 
discussed above only m t = 175 GeV, Table 1 shows that all of the cuts have almost identical 
effect on a signal with m t = 190 GeV, which reflects the relatively small dependence on the 
exact value of m t for which the cuts were optimized. 

We next present a new top quark reconstruction algorithm that, applied to events passing 
the high acceptance cuts (a-b), can measure m t directly. Our first key observation is that 
the measurement should be based on the hadronic decay t — > bqq, since the rather poor 
measurement of the neutrino momentum significantly degrades the mass resolution for t — > 
blv. Our goal is to form a histogram of rrijjj for 3-jet systems that are tagged as coming 
from t — > bqq, using the t — * blv mass only for the tagging, i.e., to recognize which of the 
four jets came from the leptonically decaying top, leaving the leftover trio as the hadronic 
decay. The location of the peak in rrijjj will measure m t (with Monte Carlo needed only to 
assess instrumental effects). The backgrounds from QCD and from incorrect jet assignment 
will be directly measured in a model-independent way by fitting the histogram. This is 
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important because it allows for the possibility that leading-order models of the background 
such as VECBOS may be quite unreliable. 

Unlike other analyses, we do not attempt to fully reconstruct the event by trying to 
identify which pair of the three jets in the hadronic decay came from the W. This keeps 
the "combinatoric problem" under control, since it cuts down the possible jet assignments 
from 12 per event to just 4. Also, since we treat the 3 jets in t — > jjj symmetrically, at the 
end of the analysis we can plot a histogram of dijet pair masses from t — > jjj candidates 
(3 combinations per event) and, without reconstruction-induced bias, observe the W — > jj 
peak in it. This will give an important independent calibration of jet energy measurement 
and jet-finding algorithms. 

Our partial reconstruction is carried out as follows. For each event that passes the S/ B 
enhancement cuts (a-b), we assign each of the four jets in turn to go with the lepton. Let 
rrijjj be the invariant mass of the remaining three jets. We select the assignment if (1) 
120 GeV < rrijjj < 240 GeV; (2) \rrij£ V — m trial | < 20 GeV; (3) \rrijt v — m trial | is the smallest 
of the four possibilities that pass (1) and (2). We took the trial top quark mass m tria i = 175 
GeV, but show below that this choice affects only the height, and not the location, of the 
mass peak in rrijjj. In practice, of course, a range of m trial may be swept to optimize the 
signal peak. The mass range for rrijjj is kept very broad, so there is ample room to separate 
peak from background. The mass range for was chosen to keep ~ 70% of the true 

signal. We have checked that this algorithm does not produce fake peaks due to either the 
QCD or combinatoric backgrounds. 

Measurement of the neutrino momentum is crucial for measurement of rrij£ v . The trans- 
verse momentum of v is taken to be the negative of the total pr observed in the calorimeter, 
giving it an uncertainty due to the uncertainties of all four jet pt's added in quadrature; 
plus contributions from inaccurate measurement of the many low p T particles in the event, 
the possibility of other neutrinos (e.g., from semi-leptonic decays in one or both 6-jets), and 
instrumental effects due to gaps in the detector coverage. The longitudinal momentum of 
the neutrino can be computed from rri£ V = raw, with a two- fold ambiguity in addition to 
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uncertainties due to the width of the W and the error in pj>. That computation is usually 
expressed by a quadratic equation for p v L , but it is much clearer to think of it as follows. 
The invariant mass mg u is given by 



By assuming mt v = mw one determines cosh(7/„ — r\i) and hence \r\ v — T)e\. The two-fold 
solution ambiguity is due to the undetermined sign of rj v — rjf the two solutions for r\ v lie on 
either side ofrje and equidistant from it. There will be considerable uncertainty in \rj£ — f] u \ 
due to errors in p v T and <p v , the finite W width, and because cosh(^£ — r/ u ) is usually close to 
1, where mp v is rather insensitive to rji — rj u . It can even happen (~ 20% of the time) that 
there is no solution, in which case rj u = rji is the best guess. When there are two solutions, 
we choose the sign of r\£ — r\ v to be that of r]£ (the solution with the smaller W energy), which 
most of the time is correct at the Tevatron, since the W's are produced rather centrally in 
rapidity due to the limited total energy. Even for the ~ 22% of events where the wrong 
solution is chosen, this rule is often adequate since the two solutions are often close to each 
other, since we only need the neutrino momentum to compute muv which is not always very 
sensitive to r) v , and since we only need m Uu measured accurately enough to tag the correct 
one of the four jets.[] 

Fig. 2 shows the resultant plot for rrijjj, with m t = 175, 190 GeV. We have plotted only 
the events passing cuts (a-b) with \mj£ u — m tr i a i| < 20 GeV, m tr iai = 175 GeV. This includes 

1 At much higher energies, such as at the LHC, there is no clear way to choose the correct neutrino 
solution in order to evaluate rriji v . For such a case, one can avoid choosing by instead using m* iu 
which is defined by minimizing rrij£ u with respect to n v . To an excellent degree of approximation, 
that is equivalent to assigning n v to the px weighted average: n* = {p^Vt + Pt^'V^t +Pt)- 
The "Jacobian Peak" in the amount of phase space near the minimum causes a sharp peak in the 
probability distribution for m* iu at a value only a slightly lower than the true peak in w,ji v . The 
quantity m* iL/ is analogous to the "transverse mass" variable used in measuring mw- 




(4) 
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32.6/38.6 signal events for m t = 175 GeV, 18.2/25.5 signal events for m t = 190 GeV, and 
only 13.7/19.8 background events. The resulting clear peak has suffered almost no shift 
away from m t , despite simulated detector smearing effects and, importantly, non-optimal 
choice of m tr iai in the case of m t = 190 GeV. (The peak for m t = 190 GeV increases by 10% 
if m tr iai = 190 GeV is used, but its location is unchanged). This result provides verification 
that our method, by relying totally on uiji v for the trijet selection, avoids introducing bias 
into the trijet mass. 

A nice cross-check of a top quark peak found using this method is shown in Fig. 3, where 
for each trio of jets in the peak, each of the three dijet mass combinations is plotted (with 
weight 1/3 each). A clear peak at my/ appears, which will provide a unique calibration for 
the hadronic calorimetry and the jet-finding algorithm. We also note that the combinatoric 
background under the W peak is substantial, which shows the wisdom of not trying to 
recognize W — > jj as part of the tt event selection. 

In conclusion, we have demonstrated the usefulness of the Binary Decision Tree technique 
in separating signal and background events for top quark production at the Tevatron. We 
showed why the new observables m^-, Tn^j, and pj<(j) strongly enhance the signal. We 
introduced an algorithm to determine the top quark mass, which yields a directly observable 
t — > bqq peak in a certain rrijjj distribution. We further showed S/B ~ 4 is achievable (for 
m t = 175 GeV), with only about 50% loss of the signal beyond typical minimal experimental 
acceptance cuts. 

Finally, we point out that the methods derived here for tt — > £ + 4 jets could be used in 
an analogous fashion to observe the total hadronic signature tt — > 6 jets [jn|. We expect a 



similar substantial increase of S/B through HASTAC-derived cuts. Given the higher event 
rate, further background suppression by requiring one 6-tagged jet would greatly reduce B 
but leave sufficient signal events. In analogy to the tag oni-> b£u, we would pick from the 5 
other jets the pair that 1) reconstructs a W boson and 2) best reconstructs t — ► jjj with the 
tagged b. Then the invariant mass of the other 3 jets should have an unbiased peak at m t . 
Work in this direction, as well as refinements of our method such as inclusion of 6-tagging 
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information for tt — > £ + 4 jets, and consideration of events with less than 4 observed jets is 
in progress. 
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FIGURES 

FIG. 1. rrijj in 2 GeV bins; the dashed curve is the background, the solid curve the sum of 
signal (nit = 175 GeV) and background, with acceptance cuts only at an integrated luminosity of 
jCdt = 100 pb- 1 . 

FIG. 2. The reconstructed top quark mass, rrijjj, in 8 GeV bins, with cuts (a-b) and the 
requirement that \m^ v — m tr i a i| < 20 GeV, for m tr i a i = 175 GeV. The solid curve indicates the 
sum of signal, with nit = 175 GeV, and background; the dashed curve gives the sum of signal, with 
vrit = 190 GeV, and background; the dotted curve gives the background alone. All are presented 
for JCdt = 100 pb" 1 . 

FIG. 3. Mass distribution (3 combinations per event) for dijets formed from hadronically 
decaying tops (m ( = 175 GeV) identified using the top mass reconstruction algorithm (cuts (a-b), 
\mji v — mtriai | < 20 with m tria i = 175 GeV, and \m,jjj — m t \ < 15 GeV) at / Cdt = 100 pb" 1 . The 
W boson mass peak, which is not used in the analysis, shows up clearly. 
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TABLES 

TABLE I. Effect of HASTAC-derived generalized cuts on signal and background, in events at 
/ Cdt = 100 pb -1 . The percentage of events relative to acceptance cuts only is given in parentheses. 



Cuts Signal (m t = 175 GeV) Signal (m t = 190 GeV) Background 





Acceptance Cuts Only 


49.0 (100%) 


32.3 (100%) 


116.0 (100%) 




(a) 


40.8 (83%) 


28.5 (88%) 


29.2 (25%) 




(a-b) 


38.6 (79%) 


25.5 (79%) 


19.8 (17%) 




(a-b) and (c) 


30.0 (61%) 


20.4 (63%) 


10.0 (9%) 




(a-b) and (d) 


22.4 (46%) 


16.5 (51%) 


6.0 (5%) 


(a) 


(0.76j40') + °- Um jjj + 0.65 m^- + 0.13 PT (W))/m w > 1.0 




(b) 


0A3m1jjj/mw — 


Q.32m] j /m w + 0.089 |r/ 4 (j)| + 0.097 \rf{j)\ < 1.0 




(c) 


(- 


-0.13/7^ + 0.797^ 


)/m w > 1.0 




(d) 


-0.31m), 


jj/mw + 1-4 mljjj/mvi 


r - 0.63|r/ 4 (j)| > 1.0 
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